Position and time sensitive closed captioning

ABSTRACT

Closed captioning information is provided regarding the location of a speaker, and when the text is spoken. An audio/video signal includes a video data and the closed captioning information. The closed captioning information includes a time index, a closed captioning text, and positioning information. The positioning information indicates a position within a frame of the video data, and is associated with the closed captioning text for a given time index. The position corresponds to the speaker who is speaking the associated closed captioning text. A decoder device decodes the closed captioning information to determine the position of the speaker within the video data, and the time index to correlate the closed captioning text and positioning information to a specific frame of video data. The video data is preferably scaled to provide a less than full screen video. The scaled video is appropriately positioned on a display screen and talk bubbles, which provide a visual link between the closed captioning text and the speaker, are preferably displayed off the scaled video. Alternatively, the video is not scaled and the talk bubbles are superimposed on the full screen video in a blended fashion.

FIELD OF THE INVENTION

The present invention relates to the field of closed captioning. Moreparticularly, the present invention relates to the field of position andtime sensitive closed captioning.

BACKGROUND OF THE INVENTION

Closed captions are a text version of the spoken part of a television,movie, or computer presentation. Closed captioning was developed to aidhearing-impaired people, but it is useful for a variety of situations.For example, captions can be read when audio can not be heard, eitherbecause of a noisy environment, such as an airport, or because of anenvironment that must be kept quiet, such as a hospital.

Closed captioning information is encoded within a video signal, in line21 of the vertical blanking interval (VBI). The text only becomesvisible with the use of a decoder, which may be built into a televisionset or available as a set top box. In general, an onscreen menu on newertelevisions allows you to turn closed captioning on or off.

Most programs are captioned in advance of transmission, but the natureof some programs, such as live sports events, requires real timecaptioning. For real time captioning, a stenographer listens to thebroadcast and types a shorthand version into a program that convertsshorthand into captions and adds that data to the television signal.

The closed captioning standard is defined by the Electronic IndustriesAlliance in EIA-708-B, “Digital Television (DTV) Closed Captioning.”EIA-708-B makes it possible for users to select the size, color, andfont of their captions and to select among multiple streams, choosing,for example, a particular language.

The typical use of closed captioning is a text box directly overlaid onthe video. This is not preferable since the text box blocks a portion ofthe video. Further, the closed captioning normally provides a visualtranscription of the dialog. Often it is difficult to determine who wasspeaking, or in other words, to whom does the visual transcription inthe closed captioning text box apply.

SUMMARY OF THE INVENTION

Embodiments of the present invention provide enhanced closed captioningfunctionality Closed captioning information is provided regarding thelocation of a speaker, and when the text is spoken. An audio/videosignal includes video data and the closed captioning information. Theclosed captioning information includes a time index, a closed captioningtext, and positioning information. The positioning information indicatesa position within a frame of the video data, and is associated with theclosed captioning text for a given time index. The position correspondsto the speaker, or actor, who is speaking the associated closedcaptioning text. The decoder device decodes the closed captioninginformation to determine the position of the speaker within the videodata, and the time index to correlate the closed captioning text andpositioning information to a specific frame of video data. The videodata is preferably scaled to provide a less than full screen video. Thescaled video is appropriately positioned on a display screen and talkbubbles, which provide a visual link between the closed captioning textand the speaker, are preferably displayed off the scaled video. In analternative embodiment, the video is not scaled and the talk bubbles aresuperimposed or overlaid on the full screen video in a blended fashion.Blending the talk bubbles is preferably performed using alpha blendedgraphics.

In one aspect of the present invention, a method of closed captioningincludes receiving an audio/video signal including a video data and aclosed captioning data, determining a position of a closed captioningtext according to position information included within the closedcaptioning data, wherein the closed captioning text is associated to aspeaker within the video data and the position of the closed captioningtext is proximate the associated speaker, and displaying the video dataon a display screen and displaying the closed captioning text on thedisplay screen according to the determined position. The closedcaptioning text can overlay the displayed video data. The closedcaptioning text can be overlaid using alpha blended graphics. The methodcan also include rendering speaker-indicator graphics to provide avisual link between the closed captioning text displayed on the displayscreen and the associated speaker. The method can also include mixingthe rendered speaker-indicator graphics and the scaled video data priorto display on the display screen. The closed captioning data can furtherinclude a timing index to associate the closed captioning text with aspecific portion of the video data. The method can also include scalingthe video data such that the scaled video data is smaller than thedisplay screen on which the scaled video data is displayed. The positionof the closed captioning text can be outside of the scaled video data.The position of the closed captioning text can partially overlay thescaled video data. The portion of the closed captioning text thatoverlays the scaled video data can be overlaid using alpha blendedgraphics. The method can also include positioning the scaled video dataon the display screen. The closed captioning data can include speakerposition information that associates the closed captioning text to aposition of the associated speaker within the video data.

In another aspect of the present invention, a method of closedcaptioning includes receiving an audio/video signal including a videodata and a closed captioning data, scaling the video data such that thescaled video data is smaller than a display screen on which the scaledvideo data is displayed, positioning a closed captioning text accordingto position information included within the closed captioning data,wherein the closed captioning text is associated to a speaker within thedisplayed scaled video data and the position of the closed captioningtext is proximate the associated speaker. The position of the closedcaptioning text can be outside of the scaled video data. The position ofthe closed captioning text can partially overlay the scaled video data.The portion of the closed captioning text that overlays the scaled videodata can be overlaid using alpha blended graphics. The method can alsoinclude positioning the scaled video data on the display screen. Themethod can also include rendering speaker-indicator graphics to providea visual link between the closed captioning text displayed on thedisplay screen and the associated speaker. The method can also includemixing the rendered speaker-indicator graphics and the scaled video dataprior to display on the display screen. The closed captioning data canfurther include a timing index to associate the closed captioning textwith a specific portion of the video data. The closed captioning datacan include speaker position information that associates the closedcaptioning text to a position of the associated speaker within the videodata.

In yet another aspect of the present invention, a device to provideclosed captioning includes an input/output (I/O) interface circuit toreceive an audio/video signal, the audio/video signal including a videodata and a closed captioning data, a display including a viewable area,and a processing circuit coupled to the display and to the I/O interfacecircuit to receive the audio/video signal from the I/O interfacecircuit, scale the video data such that the scaled video data is smallerthan the viewable area, and position a closed captioning text accordingto position information included within the closed captioning data,wherein the scaled video data is displayed on a portion of the viewablearea, thereby forming a displayed video data, the closed captioning textis associated to a speaker within the displayed video data and theposition of the closed captioning text on the viewable area is proximatethe associated speaker. The device can comprise an interactivetelevision. The closed captioning data can include speaker positioninformation that associates the closed captioning text to a position ofthe associated speaker within the video data. The device can alsoinclude a decoder to decode the received audio/video signal. Theposition of the closed captioning text can be outside of the scaledvideo data. The position of the closed captioning text can partiallyoverlay the scaled video data. The portion of the closed captioning textthat overlays the scaled video data can be overlaid using alpha blendedgraphics. The processor can render speaker-indicator graphics to providea visual link between the closed captioning text displayed on theviewable area and the associated speaker. The processor can mix therendered speaker-indicator graphics and the scaled video data prior todisplay on the viewable area. The closed captioning data can furtherinclude a timing index to associate the closed captioning text with aspecific portion of the video data.

In still yet another aspect of the present invention, a television toprovide closed captioning includes an input/output (I/O) interfacecircuit to receive an audio/video signal, the audio/video signalincluding a video data and a closed captioning data, a display includinga viewable area, and a processing circuit coupled to the display and tothe I/O interface circuit to receive the audio/video signal from the I/Ointerface circuit, scale the video data such that the scaled video datais smaller than the viewable area, and position a closed captioning textaccording to position information included within the closed captioningdata, wherein the scaled video data is displayed on a portion of theviewable area, thereby forming a displayed video data, the closedcaptioning text is associated to a speaker within the displayed videodata and the position of the closed captioning text on the viewable areais proximate the associated speaker. The television can comprise aninteractive television. The closed captioning data can include speakerposition information that associates the closed captioning text to aposition of the associated speaker within the video data. The televisioncan also include a decoder to decode the received audio/video signal.The position of the closed captioning text can be outside of the scaledvideo data. The position of the closed captioning text can partiallyoverlay the scaled video data. The portion of the closed captioning textthat overlays the scaled video data can be overlaid using alpha blendedgraphics. The processor can render speaker-indicator graphics to providea visual link between the closed captioning text displayed on theviewable area and the associated speaker. The processor can mix therendered speaker-indicator graphics and the scaled video data prior todisplay on the viewable area. The closed captioning data can furtherinclude a timing index to associate the closed captioning text with aspecific portion of the video data.

In another aspect of the present invention, a network of devices toprovide closed captioning includes a display device including a viewablearea, and a decoder comprising an input/output (I/O) interface circuitto receive an audio/video signal, the audio/video signal including avideo data and a closed captioning data, and a processing circuitcoupled to the I/O interface circuit to receive the audio/video signal,scale the video data such that the scaled video data is smaller than theviewable area, and position a closed captioning text on the viewablearea according to position information included within the closedcaptioning data, wherein the scaled video data is sent to the displaydevice to be displayed on a portion of the viewable area, therebyforming a displayed video data, the closed captioning text is associatedto a speaker within the displayed video data and the position of theclosed captioning text on the viewable area is proximate the associatedspeaker. The display device can comprise an interactive television. Thedecoder can comprise a set top box. The closed captioning data caninclude speaker position information that associates the closedcaptioning text to a position of the associated speaker within the videodata. The decoder can also include a decoder circuit to decode thereceived audio/video signal. The position of the closed captioning textcan be outside of the scaled video data. The position of the closedcaptioning text can partially overlay the scaled video data. A portionof the closed captioning text that overlays the scaled video data can beoverlaid using alpha blended graphics. The processor can renderspeaker-indicator graphics to provide a visual link between the closedcaptioning text displayed on the viewable area and the associatedspeaker. The processor can mix the rendered speaker-indicator graphicsand the scaled video data prior to display on the viewable area. Theclosed captioning data can further include a timing index to associatethe closed captioning text with a specific portion of the video data.

In yet another aspect of the present invention, a method of generatingclosed captioning including adding a closed captioning data to anaudio/video signal, wherein the audio/video signal comprises a videodata and the closed captioning data, and the closed captioning dataincludes a closed captioning text and position information thatassociates the closed captioning text to a position within the videodata. The position within the video data can correspond to a speakerassociated with the closed captioning text. The closed captioning datacan further include a timing index to associate the closed captioningtext with a specific portion of the video data.

In still yet another aspect of the present invention, a closedcaptioning stream of data comprising a video data and a closedcaptioning data, wherein the closed captioning data includes a closedcaptioning text and positioning information that associates the closedcaptioning text to a position within the video data. The position withinthe video data can corresponds to a speaker associated with the closedcaptioning text. The closed captioning data can further include a timingindex to associate the closed captioning text with a specific portion ofthe video data.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an exemplary network of devices.

FIG. 2 illustrates a block diagram of an exemplary decoder deviceaccording to the present invention.

FIG. 3 illustrates an exemplary implementation of closed captioningaccording to the present invention.

FIG. 4 illustrates a process of providing closed captioning according tothe preferred embodiment of the present invention.

DETAILED DESCRIPTION OF THE EMBODIMENTS

Embodiments of the present invention provide enhanced closed captioninginformation within an audio/video signal to a decoder device. Theaudio/video signal includes a video data and the closed captioninginformation. The closed captioning information includes a time index, aclosed captioning text, and positioning information. The positioninginformation is associated with the closed captioning text for a giventime index. The positioning information indicates a position within thevideo data. For example, a closed captioning text, caption 1, isassociated to a position (100, 100) within a given frame of video data.The position (100, 100) correlates to an (x, y) coordinate system, as iswell known in the art. Other means for conveying positioning informationcam also be used.

The position corresponds to a speaker, or actor, who is speaking theassociated closed captioning text. The decoder device decodes the closedcaptioning information to determine the position of the speaker withinthe video data, and the time index to correlate the closed captioningtext and positioning information to a specific frame of video data. Inconventional closed captioning, a text box is directly overlaid on thevideo and the contents of the text box normally provide only atranscription of the dialog.

In the present invention, the closed captioning information is enhancedto include information about the location of the speaker, and when thetext is spoken. Using the location of the speaker in the video and thetime index when the text is spoken, “talk bubbles” are added for theclosed captioned video. The talk bubbles provide a visual link betweenthe closed captioning text and the speaker. The video data is preferablyscaled to provide a less than full screen video. The scaled video isappropriately positioned on a display screen and the talk bubbles arepreferably displayed off the scaled video. An additional graphic can besuperimposed on the video that further links the talk bubble to thespeaker. Preferably, the additional graphic is a line, but any othergraphic can be used which provides a visual link from the talk bubble tothe speaker. In an alternative embodiment, the video is not scaled andthe talk bubbles are superimposed on the full screen video in a blendedfashion. Blending the talk bubbles is preferably performed using alphablended graphics. In another alternative embodiment, the video is scaledand the talk bubbles are partially superimposed and blended on thevideo, with a remaining portion of the talk bubbles displayed off thevideo. Embodiments of the present invention differ from conventionalimplementations through the use of scaled video, the use of blendedgraphics to display the closed captioning, and the use of visual linksbetween the speaker and the closed captioning text. Such animplementation minimizes the obstructions to the video and makes thedialog easier to follow.

FIG. 1 illustrates an exemplary network of devices including a displaydevice 10, a personal computer (PC) 20, and a broadband router (BBR) 30.The display device 10 preferably comprises a television with an embeddeddecoder device 110 (FIG. 2) which is discussed in greater detail below.In an alternative embodiment, the decoder device 110 comprises a standalone device, such as a set top box, which is coupled to a displaydevice, such as a television or the PC 20. In yet another alternativeembodiment, the decoder device 110 is embedded within the PC 20.Preferably, the display device 10, the BBR 30 and the PC 20 comprise ahome network.

The BBR 30 is preferably coupled to the Internet 40 via broadband, suchas DSL or cable modem. The BBR 30 is preferably coupled to a cableservice provider 50 via a conventional cable network and/or coupled to asatellite service provider 50 via a conventional satellite network.Alternatively, the display device 10 and/or the PC 20 are coupleddirectly to the Internet 40 and/or the cable/satellite service provider50.

The configuration illustrated in FIG. 1 is exemplary only. It should beapparent that an audio/video network could include many differentcombinations of components. It should also be apparent that the homenetwork can be of any conventional type, including but not limited toethernet, IEEE 1394-2000, or wireless. Network connections external tothe home network can be of any conventional type sufficient to provide aconnection to a remote content source, including but not limited to thepublic switched telephone network, cable network, and satellite network.

FIG. 2 illustrates an exemplary decoder device 110 according to thepresent invention. The decoder device 110 preferably controls thetransmission of audio/video signals from a local storage device, such asa personal computer (PC), to a display or from a remote contentprovider, such as a cable/satellite television provider, to the display.The decoder device 110 includes an input/output (I/O) interface 120, asecondary memory 130, a system memory 140, a central processing unit(CPU) 150, and a decoder 160 all coupled via a bi-directional bus 170.The I/O interface 120 preferably couples the display device 10 to acontent source (not shown) for receiving audio/video signals. The I/Ointerface 120 can also be coupled to a conventional network, such as theInternet, to receive periodic software upgrades including new versionsof operating software and new or upgraded applications. In analternative embodiment, the decoder device 110 includes a userinterface. This alternative embodiment is commonly used when the decoderdevice is used as a stand-alone device. In the alternative embodiment,the I/O interface 120 can send and receive control signals to and fromthe user interface and the display device 10. The user interface cancomprise a keypad and display, as is well known in the art.Alternatively, the user interface can comprise any conventional userinterface.

The secondary memory 130 stores the software used to enable operation ofthe decoder device 110 along with a plurality of applications. Exemplaryapplications include, but are not limited to a menu of available contentsuch as an on-screen television guide, and display parameter settingssuch as color, tint, and brightness. Preferably, the secondary memory130 is flash memory. Alternatively, any conventional type of memory canbe used. Preferably, the system memory 140 includes random access memory(RAM). The system memory 140 can also include additional buffers,registers, and cache according to specific design implementations.Audio/video signals received by the display device 10 are preferablyencrypted to prevent unauthorized access and use, and the decoder 160decrypts the audio/video signal according to access authorizationprovided by the CPU 150.

The decoder device 110 decodes the audio/video signal to obtain theclosed captioning text, the time index, the positioning information, andthe video data. The video data is appropriately scaled, and graphics aregenerated to form the talk bubbles and visual link to the speaker. Thescaled video and the graphics are mixed and appropriately positioned onthe display screen.

FIG. 3 illustrates an exemplary implementation of closed captioningaccording to the present invention. The display device 10 (FIG. 1)includes a display screen 200 to display a scaled video 210, talkbubbles 220 and 240, and graphic links 230 and 250. The scaled video 210includes actor 1 and actor 2. A closed captioning text, caption 1,associated with actor 1 is included within the talk bubble 220. The talkbubble 220 is visually linked to actor 1 via the graphic link 230. Thetalk bubble 220 is preferably positioned outside of the scaled video 210and proximate actor 1. A portion of the graphic link 230 which issuperimposed on the scaled video 210 is blended, preferably through theuse of alpha blended graphics. A closed captioning text, caption 2,associated with actor 2 is included within the talk bubble 240. The talkbubble 240 is visually linked to actor 2 via the graphic link 250. Thetalk bubble 240 is preferably positioned outside of the scaled video 210and proximate actor 2. A portion of the graphic link 250 which issuperimposed on the scaled video 210 is blended, preferably using alphablended graphics. In this manner, caption 1 and caption 2 do not blockany of the scaled video 210. Also, the position of caption 1 proximateactor 1 and the position of caption 2 proximate actor 2 makes the dialogof actor 1 and actor 2 easier to follow. The graphic links 230 and 250further enhance the ease with which caption 1 and caption 2 areassociated with actor 1 and actor 2, respectively.

The closed captioning implementation illustrated in FIG. 3 demonstratesthe preferred embodiment in which the video data is scaled and theclosed captioning text is displayed off the scaled video. In analternative embodiment, the video data is not scaled. In thisalternative case, caption 1, caption 2, talk bubble 220, talk bubble240, graphics link 230 and graphics link 250 are blended with the videodata, preferably using alpha blended graphics. The caption 1, caption 2,talk bubble 220, talk bubble 240, graphic link 230, and graphic link 250can remain positioned in the same locations as illustrated in FIG. 3, orthe positions can be farther refined to be more proximate actor 1 andactor 2 as appropriate. Since in the alternative embodiment it is not aconcern that the talk bubble 220 and 240 do not overlay the video data,positioning the talk bubbles 220 and 240 closer to their respectivespeakers, actor 1 and actor 2, may be possible. Further, if the positionof the talk bubble 220 and/or talk bubble 240 are within a predetermineddistance of their associated speakers, then there may not be a need foreither the graphics link 230 or the graphics link 250.

FIG. 4 illustrates a process of providing closed captioning according tothe preferred embodiment of the present invention. At the step 310, anaudio/video signal is received by a decoder device. At the step 320, thedecoder device decodes and extracts video data and closed captioningdata from the audio/video signal. The closed captioning data includes aclosed captioning text, a timing index, and a positioning information.At the step 330, the video data is appropriately scaled to a sizesmaller than a full-screen size of a viewing area of a display. At thestep 340, a position of the scaled video on the viewable area isdetermined Preferably, the scaled video is centered within the viewablearea. At the step 350, a position of the closed captioning text on theviewable area is determined. To determine the position of the closedcaptioning text, the extracted positioning information is used. Thepositioning information indicates a position within the video data, theposition corresponds to a speaker who is speaking the correspondingclosed captioning text at a given time index. The position within thevideo data is correlated to a position within the scaled video. Aposition for a talk bubble is then determined such that the talk bubbleposition is outside the scaled video yet proximate the associatedspeaker. The talk bubble includes the closed captioning text, therebydetermining the position of the closed captioning text. At the step 360,graphics are rendered to associate the closed captioning text to thecorresponding speaker within the scaled video. The graphics include thetalk bubble and a graphics link which preferably extends from the talkbubble to the speaker. As such, the graphics link partially overlays thescaled video. The portion of the graphics link that partially overlaysthe scaled video is preferably blended using alpha blended graphics. Atthe step 370, the rendered graphics, the closed captioning text and thescaled video are mixed and displayed on the viewable area.

Although the scaled video is preferably centered within the viewablearea, the position of the scaled video can be positioned to accommodatepositioning the closed captioning text proximate the associated speakerand outside of the scaled video. Additionally, although the presentinvention is described in regards to decoding an audio/video signal, thepresent invention can also be applied to a video signal absent an audiocomponent.

In operation, the closed captioning system of the present inventionreceives an audio/video signal that includes video data and closedcaptioning information. The audio/video signal, including video data andclosed captioning information, can be received from any availablesource, including a broadcast source or a pre-recorded source, such as avideo tape, DVD or CD-Rom type source. The video data is the video to bedisplayed and the closed captioning information includes closedcaptioning text, a timing index, and positioning information. Theaudio/video signal is decoded and the video data and the closedcaptioning information are extracted. Preferably, the video data isappropriately scaled to a size smaller than a full screen viewable areaof a display. A position of the closed captioning text is determinedoutside the scaled video, using the positioning information provided bythe audio/video signal. The positioning information includes coordinatesof a speaker within the video data to which the closed captioning textcorresponds. The timing index is used to properly correlate the closedcaptioning text and its positioning information to a specific portion ofthe video data. Using the coordinates of the speaker, the position ofthe closed captioning text is determined such that the closed captioningtext is proximate the speaker yet outside the scaled video. A talkbubble graphic is rendered to surround the closed captioning text.Preferably, the entire talk bubble is outside the scaled video and anadditional graphic, a graphic link, is added to visually link the talkbubble to the speaker. A portion of the graphic link overlays the scaledvideo. The graphic link is preferably a line, and the portion of thegraphic link that overlays the scaled video is preferably blended withthe scaled video. The closed captioning text, the rendered graphics andthe scaled video are then mixed and displayed on the viewable area ofthe display.

The present invention has been described in terms of specificembodiments incorporating details to facilitate the understanding of theprinciples of construction and operation of the invention. Suchreferences, herein, to specific embodiments and details thereof are notintended to limit the scope of the claims appended hereto. It will beapparent to those skilled in the art that modifications can be made inthe embodiments chosen for illustration without departing from thespirit And scope of the invention. Specifically, it will be apparent toone of ordinary skill in the art that while the preferred embodiment ofthe present invention is used with set-top boxes, the present inventioncan also be implemented on any other appropriate system resource limiteddevice.

1. A method of closed captioning, the method comprising: a. receiving anaudio/video signal including a video data and a closed captioning data,wherein the closed captioning data comprises position information basedon a speaker position of a speaker within the video data; b. determininga position of a closed captioning text according to the positioninformation; and c. displaying the video data on a display screen anddisplaying the closed captioning text on the display screen according tothe determined position such that a plurality of words from the closedcaptioning text are visible on the display screen concurrently.
 2. Themethod of claim 1 wherein the closed captioning text overlays thedisplayed video data.
 3. The method of claim 2 wherein the closedcaptioning text is overlaid using alpha blended graphics. 4-5.(canceled)
 6. The method of claim 1 wherein the closed captioning datafurther includes a timing index to associate the closed captioning textwith a specific portion of the video data. 7-11. (canceled)
 12. Themethod of claim 1 wherein the closed captioning data includes speakerposition information that associates the closed captioning text to thespeaker position of the associated speaker within the video data.
 13. Amethod of closed captioning, the method comprising: a. receiving anaudio/video signal including a video data and a closed captioning data,wherein the closed captioning data comprises position information basedon a speaker position of a speaker within the video data; b. displayingvideo corresponding to the video data; and c. displaying closedcaptioning text included within the closed captioning data bypositioning the closed captioning text according to the positioninformation such that a plurality of words from the closed captioningtext are visible concurrently.
 14. The method of claim 13 wherein theposition of the closed captioning text is outside of the displayedvideo.
 15. The method of claim 13 wherein the position of the closedcaptioning text partially overlays the displayed video.
 16. The methodof claim 15 wherein the portion of the closed captioning text thatoverlays the displayed video is overlaid using alpha blended graphics.17. The method of claim 13 further comprising positioning the displayedvideo on the display screen. 18-19. (canceled)
 20. The method of claim13 wherein the closed captioning data further includes a timing index toassociate the closed captioning text with a specific portion of thevideo data.
 21. The method of claim 13 wherein the closed captioningdata includes speaker position information that associates the closedcaptioning text to the position of the associated speaker within thevideo data.
 22. A device to provide closed captioning, the devicecomprising: a. an input/output (I/O) interface circuit to receive anaudio/video signal, the audio/video signal including a video data and aclosed captioning data, wherein the closed captioning data comprisesposition information based on a speaker position of a speaker within thevideo data; b. a display including a viewable area for displaying videocorresponding to the video data; and c. a processing circuit coupled tothe display and to the I/O interface circuit to receive the audio/videosignal from the I/O interface circuit, and position a closed captioningtext according to the position information such that a plurality ofwords from the closed captioning text are visible on the displayconcurrently.
 23. The device of claim 22 wherein the device comprises aninteractive television.
 24. The device of claim 22 wherein the closedcaptioning data includes speaker position information that associatesthe closed captioning text to the position of the associated speakerwithin the video data.
 25. (canceled)
 26. The device of claim 22 whereinthe position of the closed captioning text is outside of the displayedvideo.
 27. The device of claim 22 wherein the position of the closedcaptioning text partially overlays the displayed video.
 28. The deviceof claim 27 wherein the portion of the closed captioning text thatoverlays the displayed video is overlaid using alpha blended graphics.29-30. (canceled)
 31. The device of claim 22 wherein the closedcaptioning data further includes a timing index to associate the closedcaptioning text with a specific portion of the video data. 32-64.(canceled)
 65. A method of generating closed captioning including addinga closed captioning data to an audio/video signal, wherein theaudio/video signal comprises a video data and the closed captioningdata, and the closed captioning data includes a closed captioning textand position information that associates the closed captioning text to aposition within the video data based on a speaker position of a speakerwithin the video data such that when displayed a plurality of words fromthe closed captioning text are visible concurrently.
 66. The method ofclaim 65 wherein the speaker is associated with the closed captioningtext.
 67. The method of claim 65 wherein the closed captioning datafurther includes a timing index to associate the closed captioning textwith a specific portion of the video data.
 68. A closed captioningstream of data stored on a memory device comprising a video data and aclosed captioning data, wherein the closed captioning data includes aclosed captioning text and positioning information that associates theclosed captioning text to a position within the video data based on aspeaker position of a speaker within the video data such that whendisplayed a plurality of words from the closed captioning text arevisible concurrently.
 69. The closed captioning stream of data of claim68 wherein the speaker is associated with the closed captioning text.70. The closed captioning stream of data of claim 68 wherein the closedcaptioning data further includes a timing index to associate the closedcaptioning text with a specific portion of the video data.